Evergreen or Ephemeral: Predicting Webpage Longevity Through Relevancy Features

نویسندگان

  • Elaine Zhou
  • Lingtong Sun
چکیده

With the rapid proliferation of user-generated content available on the Internet, one of the biggest challenges is determining the relevancy of the information shown. The content often comes in two camps: ephemeral or evergreen. Evergreen content such as recipes for carrot cake or intro to data structures frequently don’t change with time, whereas ephemeral content, such as celebrity hot or not trends or local high school sport scores easily become dated. Unlike apps that harp on ephemerality like Snapchat, the Internet doesn’t have the luxury of assigning expiration dates to content. Humans can easily distinguish one from the other, but machines have yet to do so.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classifying Ephemeral vs Evergreen Content on the Web

ONE of the strengths of the internet is the proliferation of content available on virtually any topic imaginable. The challenge today has become sorting through this wealth of content to locate the information of greatest interest to each user. Many sites today implement recommender engines based on expressed and learned user preferences to direct users towards new content that the engine belie...

متن کامل

Learning to Analyze Relevancy and Polarity of Tweets

This paper describes the participation of Oxyme in the profiling task of the RepLab workshop. We use a machine learning approach to predict the relevancy and polarity for reputation. The same classifier is used for both tasks. Features used include query dependent features, relevancy features, tweet features and sentiment features. An important component of the relevancy features are manually p...

متن کامل

A Novel Architecture for Detecting Phishing Webpages using Cost-based Feature Selection

Phishing is one of the luring techniques used to exploit personal information. A phishing webpage detection system (PWDS) extracts features to determine whether it is a phishing webpage or not. Selecting appropriate features improves the performance of PWDS. Performance criteria are detection accuracy and system response time. The major time consumed by PWDS arises from feature extraction that ...

متن کامل

A Cost-Benefit Analysis of Leaf Habit and Leaf Longevity of Trees and Their Geographical Pattern

-To maximize net gain of a tree, leaves must be replaced when net gain of a leaf per unit time over the leaf's life span is maximum. A model in which leaf longevity is determined to maximize the net gain of a leaf per unit time is constructed. The model predicts that leaf longevity is short when initial net photosynthetic rate of the leaf is large, long when the construction cost of the leaf is...

متن کامل

Evaluating Query-Independent Object Features for Relevancy Prediction

This paper presents a series of experiments investigating the effectiveness of query-independent features extracted from retrieved objects to predict relevancy. Features were grouped into a set of conceptual categories, and individually evaluated based on click-through data collected in a laboratory-setting user study. The results showed that while textual and visual features were useful for re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014